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ZINC FINGER PROTEINS FOR 
DNA BINDING AND GENE REGULATION IN PLANTS 

TECHNICAL FIELD 
The methods and compositions disclosed herein relate generally to the field of 
regulation of gene expression and specifically to methods of modulating gene expression 
in plants by utilizing polypeptides derived firom plant zinc finger-nucleotide binding 
proteins. 

BACKGROUND 

Zinc finger proteins (ZFPs) are proteins that bind to DNA, RNA and/or protein, in 
a sequence-specific manner, by virtue of a metal stabilized domain known as a zinc 
finger. See, for example. Miller et al (1985) EMBOJ. 4:1609-1614; Rhodes et al 
(1993) Set Amer. Feb:56-65; andKlug (1999) /. MoL Biol 293:215-218. There are at 
least 2 classes of ZFPs which co-ordinate zinc to form a compact DNA-binding domain. 
Each class can be distinguished by the identities of the conserved metal-binding amino 
acids and by the associated architecture of the DNA-binding domain. 

The most widely represented class of ZFPs, known as the C2H2 ZFPs, comprises 
proteins that are composed of zinc fingers that contain two conserved cysteine residues 
and two conserved histidine residues. Over 10,000 C2H2 zmc fingers have been 
identified in several thousand known or putative transcription factors. Each C2H2 zinc 
finger domain comprises a conserved sequence of approximately 30 amino acids that 
contains the invariant cysteines and histidines in the following arrangement: -Cys-CX)2^- 
Cys-(X)i2-His-(X)3.5-His (SEQ ID NO: 1). In animal glomes, polynucleotide sequences 
encoding this conserved amino acid sequence motif are usually found as a series of 
tandem dupUcations, leading to the formation of multi-finger domains within a particular 
transcription factor. 

Several stmctural studies have demonstrated that the conserved C2H2 amino acid 
motif folds into a beta turn (containing the two invariant cysteine residues) and an alpha 
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helix (containing the two invariant histidine residues). The alpha helix and beta turn 
associate along a hydrophobic inter&ce and are held together through the tetrahedral 
cooidination of a zinc atom by the conserved cysteines and histidines. 

The three-dimensional structure of a complex between a DNA target site and a 
polypeptide comprising three C2H2 zinc fingers derived &om tiie mouse immediate early 
protein zi£268 (also known as Krox-24) has been detemiined by x-ray CTystailogrq)hy. 
Pavletich et al, (1991) Science 252:809-817. The structure reveals that the amino acid 
side chains on each zinc finger alpha helix interact specifically with functional groiq)s of 
the nucleotide bases exposed in tiie DNA major groove. Each fimger intoracts with DNA 
as a module; changes in the sequence of amino acids of the recognition helix can result in 
correspondmg changes m target site specificity. See, for example, Wolfe et al (1999) 
Annu. Rev. Biophys. Biomol Struct 3:183-212. 

Another class of ZFPs includes the so-called C3H ZFPs. See, e.g., Jiang et al 
(1996) / Biol Chem, 271:10723-10730 for a discussion of Cys-Cys-His-Cys (C3H) 
ZFPs. 

The modular nature of sequence-specific interactions between zinc fingers and 
DNA sequences (i.e., a particular zinc finger of defined sequence binds to a DNA triplet 
or quadruplet of defined sequence) allows certain DNA-binding domains of 
predetermined specificity to be designed and/or selected, fee, for example, Blackburn 
(2000) Curr, Opin. Struct. Biol 10:399-400; Segal a/. (2000) Curr. Opin. Chem, Biol 
4:34-39. To this end, numerous modifications of animal C2H2 zmc finger proteins, most 
often either mouse zi£268 or human SP-1, have been reported. See, eg., U.S. Patent 
Nos. 6,007,988; 6,013,453; 6,140,081; 6,140,466; GB Patent No. 2,348,424; 
PCTWO98/53057; PCT WO98/53058; PCT WO98/53059; PCT WO98/53060; 
PCTW098/54311; PCT WOOO/23464; PCT WO 00/42219; Choc era/. (2000) C«rr. 
Opin. Struct Biol 10:41 1-416; Segal et al (2000) supra\ and references cited in these 
publications. The results of these and other studies are generally consistent with the idea 
that it is possible to obtain C2H2 ZFPs, based on, for example, the mouse zif268 ZFP or 
the human SP-1 ZFP, of desired target site specificity. Such target-specific ZFPs are 
generally obtained by selection or design of individual fingers, each of which has a 3-4 
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nucleotide target specificity, and assembly of such fingers into a ZFP having a target site 
specificity of 9-20 nucleotides. 

C2H2 23?Ps have been identified in plants, whwe they are involved in, for 
example, developmental regulation of various floral and vegetative organs. See, e.g., 
Takatsuji (1999) Plant Mol Biol 39:1073-1078. In plant ZFPs, however, zinc fingers do 
not generally occur in closely-spaced tandem arrays. For example, in a family of DNA 
binding proteins identified in Petunia (the EPF femily), two canonical Cys2-His2 zdnc 
finger motifs are separated by an intervening stretch of between 19 and 232 amino acids. 
The binding c^ability of this class of protdns appears to be detennined by both the zinc 
fingers and the intervening amino adds, suggesting that plant zinc finger proteins have a 
different mechanism of DNA binding that do the zif268 and SP-1 zinc finger proteins, for 
example. In addition, the sequence specificity of DNA binding by EPF-type plant ZFPs 
is dependent upon different positions in the recognition helix of the zinc finger than is the 
specificity of DNA binding by most ziG68-type ZFPs. See, for example, Takatsuji 
{l996)Biochem. Biophys. Res, Comm. 224:219-223. 

Targeted gene regulation in plants would facilitate numerous ^plications such as, 
for example, the optimization of crop traits affecting nutritional value, yield, stress 
tolerance, pathogm resistance, and resistance to agrochemicals. In addition, targeted 
gene regulation could be'used to study gene fimction in plants, and to adapt plants for use 
as biological factories for the production of pharmaceutical compounds or industrial 
chemicals. Such regulation could theoretically be achieved by design of plant 
transcriptional regulatory proteins of predetCTnined DNA sequence specificity. 
However, to date, naturally occurring plant ZRPs that recognize DNA by usmg a tandem 
anangement of modular zinc finger binding domains (as do zif268 and related ZFPs) 
have not been described. Therefore, it remains difficult, if not impossible, to design a 
plant ZFP capable of recognizmg and binding to a particular predetermined nucleotide 
sequence. Furthermore, since the mechanism of DNA binding by plant ZFPs remains 
largely unknown, no immediate solution to this problem is apparent. Accordingly, the 
ability to design and/or select plant zinc fingo: proteins of predetermined target 
specificity would be desirable. 
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SUMMARY 

The present disclosure provides plant DNA-binding proteins ttiat are modified in such 
a way that their mechanism and specificity of DNA binding are detemiined by tandem arrays 
of modular zinc finger binding units. Jn this way, design strategies and selection methods 
which have been developed and utilized for other classes of ZFPs can be applied to the 
production of plant ZFPs having a predetermined target site specificity, for use in modulation 
of gene expression in plant cells. 

In one aspect, disclosed herein is a modified plant zinc finger protein (ZFP) that binds 
to a target sequence. The target sequence can be, for example, nucleic acid (DNA or RNA) or 
amino acids of any length, for instance 3 or more contiguous nucleotides. In certain 
embodiments, the modified plant ZFP comprises a tandem array of zinc fingers. One, more 
than one or all of the zinc fingers of the 22^ may be naturally occurring or may be obtained 
by rational design and/or selection (e.g., phage display, intaaction trap, ribosome display and 
RNA-peptide fusion. Thus, in certain embodiments, one or more of the zinc fingers comprise 
canonical C2H2 zinc fingers and in other embodiments, one or more of the zinc fingers 
comprise non-canonical zinc fingers. In any of the modified plant ZFPs described h^ein, one 
or naore of the zinc fingers are derived jfrom two or more plant species, for example, by 
deleting and/or substituting one or more amino acid residues as compared to a naturally 
occurring plant ZFP. In certain embodiments, one or more amino acid residues are deleted 
between one or more of the zinc fingers. 

Thus, in one embodiment, plant zinc finger proteins (ZFPs) are modified, for example, 
by deletion of inter-zinc finger sequences and/or insertion of additional zinc finger sequences, 
to generate one or more tandem arrays of zinc fingers. Thus, in contrast to naturally occurring 
plant zinc finger proteins, their mechanism and specificity of DNA binding are determined by 
tandem arrays of modular zinc finger binding units. In another embodiment, plant zinc fingers 
of disparate origin (e.g:, zinc fingers fiom Petunia and Arabidopsis) are recombined into a 
tandem array of modular zinc finger binding -units. 

In yet another aspect, a fiision polypeptide comprising (i) a modified plant ZFP as 
described herein and (ii) at least one functional domain are described. The fiinctional domain 
may be a repressive domain or an activation domain. 
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In yet mother aspect, isolated polynucleotides encoding any of the modified plant zinc 
finger proteins or fiision polypeptides described herein are provided. Also provided are 
expression vectors comprising these polynucleotides. Also described are host cell comprising 
these polynucleotides and/or expression vectors. 

In another aspect, a method for modulating gene egression in a plant cell conQ)rising 
contacting the cell with any of the modified plant ZFPs described herein is provided. In one 
embodiment, the protein comprising a tandem array of zinc fingers is provided. Preferably, 
the protein is eiqpressed in the cell, for example, by introducing the protein and/or a nucleic 
acid encoding the protein into the cell. In certain embodiments, the zinc fingers of the protein 
comprise an adapted amino add sequence at any one or more of residues -1 through +6 of the 
recognition helix. The adapted amino acid sequence can be obtained by rational design and/or 
by selection (e.g., using methods such as phage display, interaction trap, ribosome display, 
RNA-peptide fusion or combinations of one or more of these methods). In certain 
embodiments, the protein conq)rises zinc finger backbones fiom different species, for example 
different plant species. In other embodiments, the protein comprises zinc finger backbones of 
plant origin, fungal origin or combinations thereof Furthermore, in certain embodiments, the 
protein is obtained by deletion of inter-finger sequences from a plant ZP?, 

In other aspects, the metihods described herem make use of a fiision protein comprising 
a tandem array of zinc fingers and one or more fimctional domains, for example, one or more 
transcriptional activation {e.g,, CI, etc.) or repression domains. 

In other aspects, the compositions and methods described herein find use in a variety 
of explications in which modulation of gme expression alters the phenotype and/or 
composition of the plant or plant cell, for exai[q)le by optimizing crop traits such as nutritional 
value, yield, stress tolerance, pathogen resistance, resistance to agrochemicals {e.g,, 
insecticides and/or herbicides) and the Uke; and by ad^ting plants for use in production of 
pharmaceutical compounds and/or industrial chemicals. In certain anbodiments, the 
modulation of gene expression can be used to study genetic pathways and/or gene functions in 
plants. 

These and other embodiments will readily occur to those of skill in the art in light 
of the disclosure herein. 



5 



wo 02/057294 



PCTAJS02/01906 



BRIEF DESCRIPTION OF THE DRAWINGS 
Figure 1 is a schematic depicting construction of the YCF3 expression vector 
useful in expressing modified plant ZFPs. 

Figure 2 shows the results of analysis of GMT mRNA in RNA isolated from 
Arabidopsis thaliana protoplasts that had been transfected with constructs encoding 
fusion of a transcriptional activation domain with various modified plant ZFPs. Results 
are expressed as GMT mRNA normalized to 18S rRNA. AGMT numbers on the 
abscissa refer to the modified plant ZFP binding domains shown in Table 2. Duplicate 
TaqMan® analyses are shown for each RNA sample, 

DETAILED DBSCRIPTrQN 

General 

The present disclosure provides modified plant ZFPs (and functional fragments 
thereof), wherein ziuc fingers are arranged in one or more tandem arrays such that, \xpon 
DNA binding, each zinc finger contacts a triplet or quadruplet target subsite. In preferred 
embodiments, the target subsites are contiguous to one anoflier. The modified plant ZFP 
can be a fusion polypeptide and, either by itself or as part of such a fusion, can enhance 
or siq)press expressdon of a gene (/.e, modulate gene expression). Polynucleotides 
encoding modified plant ZFPs, and polynucleotides encoding fusion proteins comprising 
one or more modified plant ZFPs are also provided. Additionally provided are 
compositions comprising, in combuiation with an acceptable carrier, any of the modified 
plant zmc finger binding polypq)tides described herein or functional fragments thereof; 
and compositions comprising a nucleotide sequence that encodes a modified plant zinc 
finger binding polypeptide or functional firagment tiiereo^ wherein the modified plant 
zinc finger polypeptide or functional fragment thereof binds to a cellular nucleotide 
sequence to modulate the function of the cellular nucleotide sequence. 

Currently, ZFPs targeted to specific predetermined sequences are doived from 
non-plant ZFPs such as Xenopus TFIEA, murine zif268, human SP-1 and the like. 
Accordingly, in one embodiment, modified plant zinc finger proteins, targeted to 
predetermined sequences, are described wherein all or substantially all of the sequences 
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making up the ZFP are derived ifrom one or more plant sources. Furthermore, the 
modified plant ZFPs are organized in non-plant ZFP structures, for example structures in 
which individual zinc fingers (eg., C2H2 fingers) are linked by short linker sequences, or 
structures that do not contain native plant DNA binding sequences such as inter-zinc 
finger sequences of a plant zinc finger protein, (which can be generated 60m plant ZFPs, 
for example, by deletion of inter-zinc finger amino acid sequences). In certain 
embodiments, all amino acid residues of a modified plant ZFP are derived fix>m a non- 
modified plant ZFP (e.g., when a modified plant ZFP is obtained by deletion of inter- 
finger sequences fi^om a non-modified plant ZFP). In other embodiments, one or more 
amino acid residues of a modified plant ZFP (e.g., amino acids involved in sequence- 
specific and/or non-specific DNA contacts) can be either designed or selected, and thus 
may not constitute part of the original plant ZFP sequence. 

It is preferred that a modified plant zinc finger protein be a multi-finger protein, for 
sample comprising at least three zinc-coordinating fingers. In tiie standard nomenclature for 
ZFPs, tiie "firsf ' finger is the N-terminal-moSt finger of the protein (with respect to the other 
fingers) and binds to the 3 '-most triplet (or quadruplet) subsite in the target site. Additional 
fingers, moving towards the C-terminus of the protein, are numbered sequentially. 

In other embodiments, one or more of the component fingers of the modified plant 
ZFP will be a non-C2H2 structure. For example, in certain embodiments, a thre&-finger zinc 
finger protem is provided wherein tiie first two fingers are of the C2H2 class but the third 
finger is non-C2H2 (eg., C3H or other structure) as described, for example, in International 
Publication entitled "Modified Zinc Finger Proteins" filed even date herewith. Attorney 
docket No. 8325-0025.40). 

Therefore, the modified plant ZFPs disclosed herein diJBf^ firom previously described 
designed zinc finger protein transcription factors in that they are entirely or primarily 
composed of plant sequences. Nonetheless, the plant sequences are assembled such that the 
overall structure of the binding region of the modified plant protein is similar to that of a non- 
plant eukaryotic zinc finger. Thus, modified plant ZFPs, as disclosed herein, comprise plant 
sequences either for the entire ZFP or for most of the ZFP. In the latter case, plant sequences 
are used preferably in all regions except fliose residues involved in recognition and/or binding 
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to the target site, which can comprise, for example, sequences obtained by rational design 
and/or selection. 

It will be readily apparent that various combinations of zmc jBngers can be used in a 
single modified plant ZFP. For example, all of the finger componmts can be designed 
their sequences are obtained as a result of rational design mefliods); all of the finger 
components can be selected (i.e., their sequences are obtained by a selection method such as, 
e,g. phage display, two-hybrid systems or interaction trap assays); all of the finger 
components can be naturally-occurring plant zinc fingers; or the component fingers of a 
modified plant ZFP can be any combination of naturally-occurring plant zinc fingers, 
designed fingers and selected fingers. 

In additional embodiments, flie modified plant zinc finger proteins described herein 
(and/or fimctional fragments thereof) are used in fiision proteins, for exBanple fiisions of a 
modified plant ZFP DNA-binding domain with, e.g., a repression domain, an activation 
domain, a chromatin remodeling domain, a component of a chromatin remodeling complex, a 
methyl-binding domain, a methyltransferase, an insulator-binding protein, and/or fimctional 
fi:agments thereof Polynucleotides encoding any of the zinc finger proteins, components 
thereof, fimctional firagments thereof, and fiisions thereof are also provided. 

In additional embodiments, methods for modulating gene expression in plant cells, 
using modified plant ZFPs are provided. Because naturally-occurring plant ZFPs, which 
modulate plant gene expression in vzvo, do not contain zinc fingers in tandem arrays, flie 
ability of a ZFP containing a tandem array of zinc fingers to modulate gene expression in a 
plant cell is a surprising discovery. Thus, the compositions and methods disclosed herein 
allow the insights gained fi:om work with non-plant ZFPs such as ziJ268 and Sp-1 to be 
applied to regulation of plant gene expression by plant proteins; so that targeted regulation of 
gene expression in plant cells can be achieved by mechanisms similar to those already 
described for animal cells. Jn addition, by allowing targeted regulation of plant gene 
expression by plant proteins, the present mefliods and compositions will help to allay potential 
concerns regarding the introduction of animal protems into plants. 

The practice of the disclosed methods enq>loys, unless otherwise indicated, 
conventional techniques in molecular biology, biochemistry, genetics, computational 
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chemistry, cell culture, recombinant DNA and related iSelds as are witbin the skill of the art 
These techniques are fully explained in the literature. See, for example, Sambrook et al 
MOLECULAR CLONING: A LABORATORY MANUAL, Second edition. Cold Spring Harbor 
Laboratory Press, 1989; Ausubel et al, CURRENT PROTOCOLS IN MOLECULAR BIOLOGY, 
John Wiley & Sons, New Yoric, 1987 and periodic updates; and the series METHODS IN 
ENZYMOLOGY, Academic Press, San Diego. 

Definitions 

The terms **nucleic acid," "polynucleotide," and "oligonucleotide" are used 
interchangeably and refer to a deoxyribonucleotide or ribonucleotide polymer in either single- 
or double-stranded form. For the purposes of the presmt disclosure, these terms are not to be 
construed as limitmg with respect to the lengfli of a polymer. The terms can encompass 
known analogues of natural nucleotides, as well as nucleotides that are modified in the base, 
sugar and/or phosphate moieties. In general, an analogue of a particular nucleotide has the 
same base-pairing specificity; i.c., an analogue of A will base-pair with T. 

The terms '*polypq)tide," "peptide" and **protein" are used interchangeably to refer to 
a polymer of amino acid residues. The temi also applies to amino acid polym^ in which one 
or more amino acids are chemical analogues or modified derivatives of a corresponding 
naturally occurring amino add, for example selenocysteine (Bock et al (1991) Trends 
Biochem. Sci. 16:463-467; Nasim et al (2000)7. Biol Chem, 275:14,846-14,852) and the 
like. 

A "binding protein" is a protein that is able to bind non-covalently to another 
molecule. A binding protein can bind to, for example, a DNA molecule (a DNA-binding 
protein), an KNA molecule (an RNA-binding protein) and/or a protein molecule (a protein- 
binding protem). In the case of a protem-binding protein, it can bind to itself (to form 
homodimers, homotrimers, etc.) and/or it can bind to one or more molecules of a different 
protein or proteins. A binding protein can have more than one ^e of binding activity. For 
example, zinc finger proteins have DNA-binding, KNA-binding and protein-binding activity. 
A "binding profile" refers to a plurality of target sequences that are recognized and bound by a 
particular binding protein. For example, a binding profile can be determined by contacting a 
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binding protein with a population of randomized target sequences to identify a sub-population 
of target sequences bound by that particular binding protein. 

A "zinc jBnger binding protein" is a protein or segment within a hrger protein that 
binds DNA, RNA and/or protein in a sequence-specific manner as a result of stabilization of 
protein structure through coordination of a zinc ion. The term zinc finger binding protein is 
often abbreviated as zinc fing^ protein or ZFP. 

A zinc finger '"backbone" is the portion of a zinc finger outside the region involved in 
DNA major groove interactions; Le,, flie regions of the zinc finger outside of residues -1 
through +6 of the alpha helix. The backbone comprises the beta strands, the comiecting 
region between the second beta strand and the alpha helix, the portion of the alpha hehx distal 
to the first conserved histidine residue, and the inter-finger linker sequence(s). Thus, a plant 
zinc finger "backbone" refers to sequences derived fix)m one or more plant ZFPs, where these 
. sequence are not naturally involved in DNA major groove interactions. 

As used herein, the term "modified plant" zinc finger protein refers to a zinc finger 
protein comprising plant ZFP sequences oiganized in a non-plant ZFP structure, for example 
to eliminate the long stretches of amino acid sequence between zinc fingers found in many 
naturally-occurring plant ZFPs. Thus, all, most or some of the sequences in the zinc finger 
regions of a modified plant ZFP may be derived firom a plant. Additionally, modified plant 
ZFPs in these non-plant structures can further include one or more residues or regions (e,g,, 
fingers) of non-plant origin, such as, for example, naturally-occurring fingers or fingers as 
may be obtained by design or selection, so long as DNA binding c^ability is maintained. 

A "non-canonical" zinc finger protein is a protein not occurring in nature that has been 
designed and/or selected so as to differ fi'om the canonical binding domain consensus 
sequence Cys-Cys-His-His (eg,, Cys2-His2). Thus, non-canonical zinc finger proteins 
comprise a substitution, addition and/or deletion of at least one amino acid, compared to a 
naturally occurring zinc finger proteirL Non-limiting examples of non-canonical zinc fingers 
include binding domains comprising Cys-Cys-His-Cys (e,g., C3H) sequences and the like. 
(See, also International PubUcation entitled "Modified Zinc Finger Proteins" filed even date 
herewith. Attorney docket No. 8325-0025.40). 



10 



wo 02/057294 



PCTAJS02/01906 



A "desigtted" zinc finger protein is a protein not occurring in nature whose structure 
and composition results principally &om rational criteria. Criteria for rational design include 
q)plication of substitution niles and computerized algorithms for processing information in a 
database storing information of existing ZFP designs and binding data, for exanq)le as 
described in co-owned PCT WO 00/42219. A "selected" zinc finger protein is a protein not 
found in nature whose production results primarily fi:om an empirical process such as phage 
display, two-hybrid systems and/or interaction trap assays. See e.g., US 5,789,538; 
US 6,007,988; US 6,013,453; WO 95/19431; WO 96/06166; WO 98/54311 and lounger a/. 
(2000) Proc. Natl Acad, Sci. USA 97:7382-7387. Selection methods also include ribosome 
display systems (eg., PCT WO 00/27878) and mRNA-peptide fusion systems (e.g., US Patent 
No. 6,207,446; PCT WO 00/47775). Amino acid sequences of polypeptides (e.g., zinc 
fingers) obtained by selection or design are referred to as "adapted" amino acid sequmces. 
Designed and/or selected ZEPs are modified according to flie methods and compositions 
disclosed herein and may also be referred to as "engineered" ZFPs. 

The term "naturally-occurring" is used to describe an object that can be found in 
nature, as distinct fiom being artificially produced by a human. For example, naturally 
occurring plant ZFPs are characterized by long spacers of diverse lengths between adjacent 
zinc finger components. 

Nucleic acid or amino acid sequences are "operably linked" (or "operatively linked") 
when placed into a fimctional relationship with one another. For instance, a promoter or 
enhancer is operably linked to a coding sequence if it regulates, or contributes to the 
modulation of, the transcription of the coding sequence. Operably linked DNA sequoices are 
typically contiguous, and operably linked amino acid sequences are typically contiguous and 
in the same reading fi:^e. However, since enhancers generally Amotion when separated fiom 
the promoter by up to several kilobases or more and intronic sequences may be of variable 
lengths, some polynucleotide elements may be operably linked but not contiguous. Similarly, 
certain amino acid sequences that are non-contiguous in a primary polypeptide sequence may 
nonetheless be operably linked due to, for exanrple folding of a polypeptide chain. 

With respect to fiision polypeptides, the term "operatively linked" can refer to the fact 
fliat each of the components performs the same function in linkage to the other component as 
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it would if it were not so linked. For example, with respect to a fusion polypeptide in which a 
modified plant ZFP DNA-binding domain is fused to a functional domain (or functional 
fragment hereof), the ZFP DNA-binding domain and the functional domain (or functional 
fragment tiiereof) are in operative linkage ij^ in the fusion polypeptide, the modified plant ZFP 
DNA-binding domain portion is able to bind its target site and/or its binding site, while the 
functional domain (or functional fragment thereof) is able to modulate (e.g., activate or 
repress) transcription. 

"Specific binding" between, for example, a ZFP and a specific target site means a 
binding aflSnity of at least 1 x 10^ M"\ 

A "fusion molecule" is a molecule m which two or more subunit molecules are linked, 
preferably covalently. The subunit molecules can be the same chemical type of molecule, or 
can be different chemical types of molecules. Examples of the first type of fusion molecule 
include, but are not limited to, fiision polypeptides (for example, a fusion between a modified 
plant ZFP DNA-binding domain and a functional domain) and fusion nucleic acids (for 
example, a nucleic acid encoding the fusion polypeptide described herein). Examples of the 
second type of fusion molecule include, but are not limited to, a fusion between a trq)lex- 
forming nucleic acid and a polypeptide, and a fusion between a minor groove binder and a 
nucldc acid. 

A "gene," for the purposes of the present disclosure, includes a DNA region encoding 
a gene product (see below), as well as all DNA regions that regulate the production of ttie 
gene product, whether or not such regulatory sequences are adjacent to coding and/or 
transcribed sequences. Accordingly, a gene includes, but is not necessarily limited to, 
promoter sequences, terminators, translational regulatory sequences such as ribosome binding 
sites and internal ribosome entry sites, enhancers, silencers, insulators, boundary elements, 
replication origins, matrix attachment sites and locus control regions. Further, a promoter can 
be a normal cellular promoter or, for example, a promoter of an infecting microorganism such 
as, for example, a bacterium or a virus. 

"Gene expression" refers to the conversion of the information, contained in a gene, 
into a gme product. A gene product can be the direct transcriptional product of a gene {e.g. , 
mRNA, tRNA, rRNA, antisense KNA, ribozyme, structural RNA or any other type of RNA) 
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or a protein produced by translation of an mRNA. Gene products also include RNAs which 
are modified, by processes such as cq)ping, polyadenylation, methylation, and editing, and 
protems modified by, for exanrple, mefliylation, acetylation, phosphorylation, ubiquitination, 
ADP-ribosylation, myristilation, and glycosylatioa 

"Gene activation" and "augmentation of gme e:q)ression" refer to any process that 
results in an increase in production of a gene product A gene product can be either RNA 
(including, but not limited to, mRNA, iKNA, tRNA, and structural RNA) or protein. 
Accordingly, gene activation includes those processes that increase transcription of a gene 
and/or translation of an mRNA, Examples of gene activation processes which increase 
transcription include, but are not limited to, those which facilitate formation of a transcription 
initiation complex, those which increase transcription initiation rate, those which increase 
transcription elongation rate, those which mcrease processivity of transcription and those 
which relieve transcriptional repression (by, for example, blocking the binding of a 
transcriptional repressor). Gene activation can constitute, for example, inhibition of 
repression as well as stimulation of expression above an existing level. Examples of gene 
activation processes that increase translation include those that increase translational 
initiation, those that increase translational elongation and those that increase mRNA stability. 
In general, gene activation comprises any detectable increase in the production of a gene 
product, preferably an increase in production of a gene product by about 2-fold, more 
preferably fi^om about 2- to about 5-fold or any integral value therebetween, more preferably 
between about 5- and about 10-fold or any integral value therebetween, more preferably 
between about 10- and about 20-fold or any integral value therebetween, still more preferably 
between about 20- and about 50-fold or any integral value therebetween, more preferably 
between about 50- and about 100-fold or any integral value tiierebetween, more preferably 
100-fold or more. 

"Gene repression" and 'inhibition of gene expression" refer to any process that results 
in a decrease ia production of a gene product. A gene product can be either RNA (including, 
but not limited to, mRNA, rRNA, tRNA, and structural RNA) or protein. Accordingly, gene 
repression includes those processes that decrease transcription of a gene and/or translation of 
an mRNA. Examples of gene repression processes which decrease transcription include, but 
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are not limited to, those which inhibit fonnation of a transcription initiation conq)lex, those 
which decrease transcription initiation rate, those which decrease transcription elongation rate, 
those which decrease processivity of transcription and those which antagonize transcriptional 
activation (by, for exanq)le, blocking the bindmg of a transcrq)tional activator). Gene 
repression can constitute, for example, prevention of activation as well as inhibition of 
expression below an existing level. Exanq)les of gene repression processes that decrease 
translation include fliose that decrease translational initiation, those that decrease translational 
elongation and those that decrease mRNA stabihty. Transcriptional repression includes both 
reversible and irreversible inactivation of gene transcriptioa In general, gene repression 
comprises any detectable decrease in the production of a gene product, preferably a decrease 
in production of a gene product by about 2-fold, more preferably &om about 2- to about 5-fold 
or any integral value therebetween, more preferably betweai about 5- and about 10-fold or 
any integral value therebetween, more preferably between about 10- and about 20-fold or any 
integral value therebetween, still more preferably between about 20- and about 50-fold or any 
integral value therebetween, more preferably between about 50- and about 100-fold or any 
integral value therebetween, more prefisrably 100-fold or more. Most preferably, gene 
repression results in complete inhibition of gene expression, such that no gene product is 
detectable. 

The term "modulate" refers to a change in the quantity, degree or extent of a function. 
For example, the modified plant ziac finger-nucleotide binding polypeptides disclosed herein 
can modulate the activity of a promoter sequence by binding to a motif within the promoter, 
thereby inducing, enhancing or suppressing transcription of a gene operatively Imked to the 
promoter sequence. Alternatively, modulation may include inhibition of transcription of a 
gene wherem the modified zinc finger-nucleotide binding polypeptide binds to the structural 
gene and blocks DNA dependent RNA polymerase from reading through the gene, thus 
inhibiting transcription of the gene. The structural gene may be a normal cellular gene or an 
oncogene, for example. Alternatively, modulation may include inhibition of translation of a 
transcript. Thus, '^modulation" of gene escpression includes both gene activation and gene 
repression. 
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Modulation can be assayed by detennining any parameter that is indirectly or directly 
affected by the expression of the target gene. Such parameters include, eg., changes in RNA 
or protein levels; changes in protein activity; changes in product levels; changes m 
downstream gene expressiot^ changes in transcription or activity of reports genes such as, for 
example, luciferase, CAT, beta-galactosidase, or GFP (see, e.g., Mistili & Spector, (1997) 
Nature Biotechnology 15:961-964); changes m signal transduction; changes in 
phosphoiylation and dephosphoiylation; changes in receptor-Ugand interactions; changes in 
concentrations of second messengers such as, for example, cGMP, cAMP, IP3, and Ca2'*^; 
changes in cell growth, changes in chemical composition {e.g,, nutritional value), and/or 
changes in any functional effect of gene expression. Measuronents can be made in vitro, in 
vzvo, and/or ex vivo. Such functional effects can be measured by conventional methods, e.g, , 
measurement of RNA or protein levels, measuranent of RNA stability, and/or identification 
of downstream or reporter gene expression. Readout can be by way of, for example, 
chemiluminescence, fluorescence, colorimetric reactions, antibody binding, inducible 
markers, ligand binding assays; changes in intracellular second messengers such as cGMP and 
inositol triphosphate (IP3); changes in intracellular calcium levels; cytokine release, and the 
like. 

•*Eucaryotic cells" include, but are not limited to, fungal cells (such as yeast), plant 
cells, animal cells, mammalian cells and human cells. Similarly, "prokaryotic cells' include, 
but are not limited to, bacteria. 

A "regulatory domain" or "functional domain" refers to a protein or a polypeptide 
sequence that has transcriptional modulation activity, or that is capable of interacting with 
proteins and/or protein domains that have transcriptional modulation activity. Typically, a 
functional domain is covalentiy or non-covalently linked to a ZFP to modulate transcription of 
a gene of int^est. Alternatively, a ZFP can act, in the absence of a functional domain, to 
modulate transcription. Furthermore, transcription of a gene of interest can be modulated by a 
ZFP linked to multiple functional domains. 

A "functional firagment" of a protein, polypeptide or nucleic acid is a protein, 
polypeptide or nucleic acid whose sequence is not identical to the full-length protein, 
polypq)tide or nucleic acid, yet retains the same function as the full-length protein. 
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polypeptide or nucleic acid. A functional fiagment can possess more, fewer, or the same 
number of residues as the corresponding native molecule, and/or can contain one ore more 
amino acid or nucleotide substitutions. Methods for deteraaining the function of a nucleic acid 
{e.g,, coding function, ability to hybridize to another nucleic acid) are well known in the art 
Similarly, methods for detennining protein fimction are well known. For example, the DNA- 
binding function of a polypeptide can be determined, for example, by filter-binding, 
electrophoretic mobility-shift, or immunoprecipitation assays. See Ausubel et aL, supra. The 
ability of a protein to interact with another protein can be determined, for example, by co- 
immunoprecipitation, two-hybrid assays or complementation, both genetic and biochemical. 
See, for example. Fields a/. (1989) Afcft/re 340:245-246; U.S. Patent No. 5,585,245 and 
PCT WO 98/44350. 

A 'target site" or **target sequence" is a sequence that is bound by a binding protein 
such as, for example, a ZFP. Target sequences can be nucleotide sequences (either DNA or 
RNA) or amino acid sequences. By way of example, a DNA target sequence for a three- 
finger ZFP is generally either 9 or 10 nucleotides in length, depending upon the presence 
and/or nature of cross-strand interactions between the ZFP and flie taiget sequence. Target 
sequences can be found in any DNA or RNA sequence, including regulatory sequences, 
exons, introns, or any non-coding sequence, 

A "target subsite" or "subsite" is the portion of a DNA target site that is bound by a 
single zinc finger, excluding cross-strand interactions. Thus, in the absence of cross-strand 
interactions, a subsite is generally three nucleotides in length. In cases in which a cross-strand 
interaction occurs (e.g., a '*D-able subsite," as described for example in co-^wned PCT WO 
00/42219) a subsite is four nucleotides in length and overl^s with another 3- or 4-nucleotide 
subsite. 

The term "effective amount" includes that amount which results in the desired result, 
for example, deactivation of a previously activated gene, activation of a previously repressed 
gene, or inhibition of transcription of a structural gene or translation of RNA 
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Zinc Finger Proteins 

Zinc finger proteins are polypeptides that comprise zinc finger conqwnents. For 
example, zdnc finger proteins can have one to Ihirty-sevm fiingeis, conunonly having 2, 3, 4, 5 
or 6 fingers. Zinc finger DNA-binding proteins are described^ for example, in Miller et al 
(1985) EMBO J. 4:1609-1614; Rhodes et al (1993) Scientific American Feb.:56-65; and 
King (1999) /. Mol Biol 293:215-218. A zinc finger protein recognizes and binds to a target 
site (sometimes referred to as a target sequence or target segment) that represents a relatively 
small portion of sequence within a target gene. Each component finger of a zinc fiuager 
protein binds to a subsite within the target site. The subsite includes a triplet of three 
contiguous bases on the same strand (sometimes referred to as the target strand). The three ' 
bases in the subsite can be individually denoted file 5' base, the mid base, and the 3' base of 
the triplet, respectively. The subsite may or may not also include a fourth base on the non- 
target strand that is the complement of the base immediately 3' of the three contiguous bases 
on file target strand. The base immediately 3 ' of the three contiguous bases on the target 
strand is sometimes referred to as the 3' of the 3* base. Altmiatively, fiie four bases of the 
target strand in a four base subsite can be numbered 4, 3, 2, and 1, respectively, starting firom 
the 5' base. 

Zinc finger proteins have been identified in a variety of species. While plant ZFPs are 
characterized by long spacers between fingers, non-plant ZFPs have much shorter linkers 
between-finger regions. An exemplary non-plant ZFP is the human transcription factor, Sp-1 . 
As described in detail in WO 00/42219, each of fiie fliree zinc fingers in Sp-1 is J5)proximately 
30 amino acids in length and is made up of a beta turn (approximately 12 residues in length), 
and alpha heUx (approximately 10-12 residues in lengfli) and short sequence connecting 
between the beta turn and the alpha helix of approximately 2 residues and an inter-finger 
linker sequence of 4-5 residues. Exemplary sequences of the zinc fingers of Sp-1 are shown 
in co-owned WO 00/42219. Also disclosed in WO 00/42219 is an SP-1 consensus sequence, 
as described by Berg (1992) Proa Natl Acad, Sd, USA 89:1 1,109-1 1,1 10, which is usefiil in 
the design of targeted zinc finger proteins. 

Furthermore, in discussing the specificity-determining regions of a zinc finger, amino 
acid +1 refers to the first amino add in fiie alphar-helical portion of the zinc finger. The 
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portion of a zinc finger that is generally believed to be responsible for its binding specificity 
lies between -1 and +6. Amino acid ++2 refers to the amino add at position +2 in a second 
zinc finger adjacent (in flie C-tenninal direction) to the zmc finger under consideration, hi 
certain ciicmnstances, a zmc finger binds to its triplet subsite substantially independently of 
other fingers in the same zinc finger protein. Accordingly, the binding specificity of a zmc 
finger protein containing multiple fingers is, to a first approximation, the aggregate of the 
specificities of its component fingers. For example, if a zmc finger protein is formed &om 
first, second and third fingers that individually bind to triplets XXX, YYY, and ZZZ, the 
binding specificity of the zinc finger protein is 3'-XXX YYY ZZZ-5\ 

The relative order of fingers in a zinc finger protein, firom N-termmal to C-terminal, 
determines the relative order of triplets in the target sequence, in the 3* to 5* direction that will 
be recognized by the fingers. For example, if a zinc finger protein comprises, firom N- 
terminal to C-terminal, first, second and third fingers that individually bind to the triplets 
5'-GAC-3*, 5'-GTA-3' and 5 '-GGC-3', respectively, then the zinc finger protein binds to the 
target sequence 5'-GGCGTAGAC-3* (SEQ ID NO: 2). If the zinc finger protein comprises' 
the fingers in another order, for example, second finger, first finger, third finger, then tiie zmc 
finger protein bmds to a target segment comprising a different pamutation of triplets, in fliis 
example, 5'.GGCGACGTA-3' (SEQ ID NO: 3). See Berg et al (1996) Science 271:1081- 
1086. The numbering convention used above is standard in the field for the region of a zinc 
finger conferring binding specificity. The amino acid on the N-terminal side of flie first 
invariant His residue is assigned the number +6, and other amino acids, proceeding in an N- 
terminal direction, are assigned successively decreasing numbers. The alpha helix generally 
begins at residue +1 and extends to the residue following the second conserved histidine. The 
entire helix can therefore be of variable length, e.g., between 1 1 and 13 residues. 

A. Modified plant ZFPs 

A modified plant zmc finger protein is an amino acid sequence, or variant or fragment 
thereof, which is capable of binding to a target sequence and which comprises sequences 
derived firom plant sources which have been reassembled in a non-plant ZFP structm^. Thus, 
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one or more of the following regions of a modified plant zinc finger are derived fiom one or 
more plant sources: the first beta strand, the second beta strand, the alpha helix, and the linker. 

It is to be understood that "non-plant" structure refers to any structure that deviates 
fix)m typical naturally occurring plant ZFPs. One example of a non-plant ZFP scaffold 
suitable for providing a template for assembling plant-derived sequences is one in which the 
number of residues between the second histidine of one finger and the first cysteine of the 
adjacent, C-terminal finger is relatively short. In contrast to typical non-plant ZFPs, plant 
ZFPs are characterized by long sfpacers between adjacent fingers. Thus, m certain 
embodiments, a non-plant structure refers to ZFPs which contain tandem arrays of zinc 
fingers, /.e, structures in which there are between 5 and 50 amino acids between fingers, more 
preferably between 5 and 25 amino acids and even more preferably between 5 and 20 amino 
acids, or any integer therebetween. 

Thus, in certain embodiments, the modified plant ZFPs disclosed herein will not 
contain the sequence QALGGH (SEQ ID NO:105) in the recognition region, which is highly 
conserved in many plant ZFPs. feeTakatsuji, (1999) Plant Mol. Biol 39:1073-1078 and 
references cited therein. Yet another example of a non-plant ZFP structure is one that 
comprises both canonical C2H2 fingers and non-canonical (e.g., non- C2H2) fingers. (See, also 
International Publication entitled "Modified Zinc Finger Proteins" filed even date herewith. 
Attorney docket No. 8325-0025.40). Other examples of non-plant structures can be readily 
determined by those of skill in the art in view of the teachings herein. Furthermore, it is to be 
understood that the modified plant ZFPs described herein may have one or more of these non- 
plant organization characteristics. 

Thus, although the modified plant ZFPs disclosed herein are composed wholly or 
partly of plant sequences, they have a non-plant structure. The non-plant structure of the 
modified plant ZFP can be similar to fliat of any class of non-plant ZFP, for instance the C2H2 
canonical class of ZFPs as exemplified by TFIIIA, Zif268 and Sp-1. Furthermore, the 
modified plant ZFP can comprise sequences fiiom more than one class of ZFP, and selecting 
particular DNA binding residues and plant backbone residues to achieve the desired effector 
functions is within the ordinaiy skill in the art. The Sp-1 sequence used for construction of 
targeted zinc finger proteins corresponds to amino acids 531 to 624 in the Sp-1 transcription 
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factor. Thus, models for design of modified plant ZFPs include, but are not limited to, Sp-1 
and an Sp-1 consensus sequence, described by Berg (1992) Proc Natl Acad. Sci. USA 
89:11,109-11,110 and by Shi e/ a/. {\99S) Chemistry and Biology \ m-%9. The amino acid 
sequences of these ZFP fi:amewoiks are disclosed in co-owned PCT WO 00/42219. Fungal 
ZFPs can also be used as models for design and/or as sources of zinc finger sequences for 
modified plant ZFPs. See, WO 96/32475. Other suitable ZFPs are known to those of 
skill in the art and are described herem. The documents cited herem also disclose methods of 
assessing binding specificity of modified ZFPs. 

Optionally, modified plant ZFPs can include one or more residues not present in a 
naturally occurring plant zinc fing^ such as pan be obtained by, for example, design and/or 
selection. For example, one or more sequence in the alpha-helical region, particularly 
residues involved in target-recognition amino adds -1, +2, +3 and +6), can be altered 
with respect to a naturally occurring plant ZFP. Any recognition sequence can be chosen, for 
example, by selecting residues known to bind to certain target sequences, detemiined as 
described herein and in the references cited hereia 

Sequences from any ZFP that is used in the niethods described herein can be altered by 
mutagenesis, substitution, insertion and/or deletion of" one or more residues so that the non- 
recognition plant-derived residues do not correspond exactty to the zinc finger firom which 
they are derived. Preferably, at least 75% of the modified plant ZFP residues will correspond 
to those of the plant sequences, more oftra 90%, and most preferably greater than 95%. 

In general, modified plant ZFPs are produced by a process of analysis of plant 
sequences, for example those sequences that are publicly available on any number of 
databases. Three-dimensional modeling can.be used, but is not required. Typically, plant 
sequences are selected for their homology to non-plant ZFPs, for example, by selecting plant 
ZFPs tiiat most closely resemble the chosen non-plant ZFP scaffold (eg., a C3H structures 
and/or C2H2 ZFP structure such as Sp-1 or Sp-1 consensus) and binding mode. The plant 
sequences are then assembled in a non-plant binding mode structure, for instance as three zinc 
fingers separated by short linkers, as are present in non-plant ZFPs. Thus, the process of 
obtaining a modified plant ZFP with a predetermined bmding specificity can begin by analysis 
of naturally occurring plant ZFPs. 
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Once selected plant sequences have been organized and ass^bled to reflect a non- 
plant structure, alterations in the recognition residues positions -1 to +6 of the alpha 
helix) can be made so as to confer a desired bmding specificity, for exanq)le as described in 
co-owned WO 00/42219; WO 00/41566; as well as US. Patents 5,789,538; 6,007,408; 
6,013,453; 6,140,081 and 6,140,466; and PCT publications WO 95/19431, WO 98/54311, 
WO 00/23464; WO 00/27878; WO98/53057; WO98/53058; WO98/53059; and WO98/53060. 

In other embodiments, one or more residues, for example recognition residues, can be 
derived fiom non-plant sources and inserted into the modified plant 2TP structure. In 
particular, non-plant sequences that have previously been shown to bind to specific target 
sequences can be incorporated into the modified plant ZFP to provide the desired binding 
specificity. Thus, the modified plant ZFPs can include, one or more non-plant derived 
residues involved in DNA binding whore these binding residues have been designed and/or 
selected to recognize a particular target site, for example as described. 

In certain embodiments, modified plant ZFPs, as disclosed herein, contain additional 
modifications in their zinc fingers, for example, as described in ^plications of which the 
benefit is claimed herein. Such additional modifications include, for example, substitution of 
a zinc-coordinating amino acid residue cysteine and/or histidine) with a different amino 
acid. A modified ZFP of this type can include any number of zinc finger components, and, in 
one embodiment, contains three zinc fingers. Typically, the C-tenninal-most {e,g,, third) 
finger of the ZFP is substituted in one or more zinc-coordinating residues. The other fingers 
of the protein can be naturally occurring zinc finger components, modified plant components, 
canonical C2H2 fingers or combinations of tiiese components. 

Also included herein are nucleic acids encoding a ZFP comprising at least one 
modified plant zinc finger as described herein. 

B. Linkage 

Two or more zinc finger proteins can be Unked to have a target site specificity that is, 
to a first ^proximation, the aggregate of that of the component zinc finger proteins. For 
example, a first modified plant zinc finger protein having first, second and third component 
fingers that respectively bind to XXX, YYY and ZZZ can be linked to a second modified 



21 



wo 02/057294 



PCT/US02/01906 



plant zinc finger protein having first, second and third component fingers with binding 
spedficities, AAA, BBB and CCC. The binding specificity of the combined first and second 
proteins is thus 5'-CCCBBBAAANZZZYYYXXX-3* (SEQ ID N0:4), where N indicates a 
short intervening region (typically 0-5 bases of any type). In this situation, the target site can 
be viewed as comprising two target seg^rots separated by an intervening segment. 

Linkage of zinc fingers and zinc finger proteins can be accomplished using any of the 
following peptide linkers: 

TGEKP (SEQ ID NO: 5) Liu et al (1997) Proc. Natl Acad. Set USA 94:5525-5530. 

{GaS\ (SEQ id NO: 6) Yiim etal {1996) Proc. Natl Acad. Sci, J7i£4 93:1156-1160. 

GGRRGGGS (SEQ ID NO: 7) 

LRQRDGERP (SEQ ID NO: 8) 

LRQKDGGGSERP (SEQ ID NO: 9) 

LRQKD(G3S)2ERP (SEQ ID NO: 10). 

Alternatively, flexible linkers can be rationally designed using computer programs 
capable of modeling both DNA-bmding sites and the peptides thanselves, or by phage display 
methods. Li a further variation, non-covalent linkage can be achieved by fusing two zinc 
fingo: proteins with domains promoting heterodimer formation of the two zmc finger proteins. 
For example, one zinc finger protein can be fused with fos and the other with jun (see Baibas 
et al, WO 95/1 1943 1). Alternatively, dimerization interfaces can be obtained by selection. 
See, for example, Wang et al (1999) Proc. Natl Acad. Set USA 96:9568-9573. 

C. Fusion Molecules 

The modified plant zinc finger proteins described herein can also be used in the 
design of fusion molecules that fecilitate regulation of gene expression, particularly in 
plants. Thus, in certain embodiments, the con:q)ositions and methods disclosed herein 
involve fusions between at least one of the zinc finger proteins described herein (or 
functional firagments thereoQ and one or more functional domains (or functional 
fi:agments thereof, or a polynucleotide encoding such a fusion. The presence of such a 
fusion molecule in a cell allows a functional domain to be brought into proximity with a 
sequence in a gene that is bound by the zinc finger portion of the fusion molecule. The 
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transCTiptioiial regulatoiy function of the fimctipnal domain is then able to act on the 
gene, by, for example, modulating expre^on of the gene. 

In certain embodiments, fusion proteins comprising a modified plant zinc finger 
DNA-biading domain and a functional domain are used for modulation of endogenous 
gCTie expression as described, for example, in co-owned PCT WO 00/41566. Modulation 
includes repression and activation of gene expression; the nature of the modulation 
gmerally depending on the type of functional domain present in the fusion protein. Any 
polypq>tide sequence or domain cq)able of influencing gene expression (or functional 
fragment thereof) that can be fused to a DNA-binding domain, is suitable for use. 

An exemplary functional domain for fusing with a ZFP DNA-binding domain, to 
be used for repressing gene expression, is a ERAB repression domain firom the human 
KOX-1 protein (see, e.g., Thiesen et al., New Biologist 2, 363-374 (1990); Margolin et 
al., Proc. Nafl. Acad. Sci. USA 91, 4509-4513 (1994); Pengue et al., Nucl. Acids Res. 
22:2908-2914 (1994); Witzgafl et al., Proc. Natl. Acad. Sci. USA 91, 4514-4518 (1994). 
Another suitable repression domain is methyl binding domain protdn 2B (MBD-2B) 
(see, also Hendrich et al. (1999) Mamm Genome 10:906-912 for description of MBD 
protems). Another useful repression domain is that associated with the v-ErbA protein. 
See, for example, Damm, et al. (1989) Nature 339:593-597; Evans (1989) Int, J. Cancer 

4:26-28; Painetal. (1990) JVeivAo/. 2:284-294; Sap et al. (1989) Afemre 
340:242-244; Zenke et al. (1988) Cell 52:107-119; andZenke et al. (1990) Cell 61:1035- 
1049. Additional exemplary repression domains include, but are not limited to, thyroid 
hormone receptor (TR), SID, MBDl, MBD2, MBD3, MBD4, MBD-like proteins, 
members of tiie DNMT family (eg., DNMTl, DNMT3A, DNMT3B), Kb, MeCPl and 
MeCP2. &e, for example, Zhang et al (200.0) Ann Rev Physiol 62:439-466; Bird et al. 
(1999) Cell 99:451-454; Tyler et al (1999) Cell 99:443-446; Knoepfler et al (1999) 
CeH 99:447-450; and Robertson e/ (2000) ^af«re Gene^. 25:338-342. Additional 
exemplary repression domains include, but are not limited to, R0M2 and AtHD2A. See, 
for example, Oiem et al (1996) Plant Cell 8:305-321; and Wu et al (2000) Plant J. 
22:19-27. 
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Suitable domains for achieving activation include the HSV VP16 activation 
domain (see, e.g., Hagmann et al., J. Virol. 71, 5952-5962 (1997)) nuclear hormone 
receptors (see, e.g., Torchia et al., Curr. Opin. CelL Biol. 10:373-383 (1998)); the p65 
subunit of nuclear factor kappa B (Bi±o & Bank, J. Virol. 72:5610-5618 (1998) and 
Doyle & Hunt, Neuroreport 8:2937-2942 (1997)); Liu et al,, Cancer Gene Ther. 5:3-28 
(1998)), or artificial chimeric functional domains such as VP64 (Seij^al et al., EMBO J. 
11,4961-4968(1992)). 

Additional exenq}lary activation domains include, but are not limited to, p300, 
CBP, PCAF,SRC1 PvALF, and ERF-2. See, for example, Robyr et al. (2000) MoL 
Endocrinol 14:329-347; CoUingwood et cd. (1999) J. MoL EfidocrinoL 23:255-275; 
Leo et al (2000) Gene 245:1-11; Manteuffel-Cymborowska (1999) Acta Biochim. Pol 
46:77-89; McKenna et al (1999) / Steroid Biochem, Mol Biol 69:3-12; Malik et al 
(2000) Trends Biochem. Sci. 25:277-283; and Lemon et al (1999) Curr. Opin. Genet. 
Dev. 9:499-504. Additional exemplary activation domains include, but are not limited to, 
OsGAI, HALF-1, CI, API, ARF-S, -6, -7, and -8, CPRFl, CPRF4, MYC-RP/GP, and 
TRABl. See, for example, Ogawa et al (2000) Gene 245:21-29; Okanami et al (1996) 
Genes Cells 1:87-99; GoSetal (1991) Genes Dev. 5:298-309; Cho etal (1999) Plant 
Mol Biol 40:419-429; Uhnason et al (1999) Proc. Natl Acad Sci. USA 96:5844-5849; 
Sprenger-Haussels et al (2000) Plant J. 22:1-8; Gong et al (1999) Plant Mol Biol 
41:33-44; and Hobo et al (1999) Proc. Natl Acad Sci. USA 96:15,348-15,353. 

Additional functional domains are disclosed, for example, in co-owned 
WO 00/41566. Further, insulator domains, chromatin remodeling proteins such as ISWI- 
containing domains and/or methyl binding domain proteins suitable for use in fusion 
molecules are described, for example, in co-owned International Publications WO 
01/83793 andPCT/USOl/42377. 

In additional embodiments, targeted remodeling of chromatin, as disclosed, for 
example, in co-owned international Publication WO 01/83793, can be used to generate 
one or more sites in plant cell chromatin fliat are accessible to the binding of a functional 
domain/modified plant ZFP fusion molecule. 
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Fusion molecules are constructed by methods of cloning and biochemical 
conjugation that are well known to those of skill in the art Fusion molecules comprise a 
modified plant ZFP binding domain and, for example, a transcriptional activation 
domain, a transcriptional repression domain, a component of a chromatin remodeling 
complex, an insulator domain or a functional fragment of any of these domams. hi 
certain embodiments, fusion molecules con^rise a modified plant zinc fing^ protein and 
at least two functional domains (e.g., an insulator domain or a methyl binding protein 
domain and, additionally, a transcriptional activation or repression domain). Fusion 
molecules also optionally comprise a nuclear localization signal (such as, for example, 
that fit)m the SV40 T-antigen or the maize Opaque-2 NLS) and an epitope tag (such as, 
for example, FLAG or hemagglutinin). Fusion proteins (and nucleic acids encoding 
them) are designed such that the translational reading fi^e is preserved among the 
components of the fusion. 

The fusion molecules disclosed herein comprise a modified plant zinc finger 
binding protein that binds to a target site, hi certain embodiments, the target site is 
present in an accessible region of cellular chromatiiL Accessible regions can be 
determined as described in co-owned Ihtemational Pubhcations WO 01/83751 and WO 
01/83732. If the target site is not present in an accessible region of cellular chromatin, 
one or more accessible regions can be generated as described in co-owned Intemational 
Publication WO 01/83793. In additional embodiments, one or more modified plant zinc 
finger components of a fusion molecule are cq)able of binding to cellular chromatin 
regardless of whether its target site is in an accessible region or not. For example, a ZFP 
as disclosed herein can be capable of binding to linker DNA and/or to nucleosomal DNA. 
Examples of this type of '"pioneer" DNA binding domain are foxmd in certain steroid 
receptors and in hepatocyte nuclear factor 3 (HNF3). Cordingley et cU. (1987) Cell 
48:261-270; Pma et aL (1990) Cell 60:719-131; mdCiimoetaL(l99«)EMBOJ, 
17:244-254. 

Methods of gene regulation usrug a functional domain, targeted to a specific 
sequence by virtue of a fused DNA binding domain, can achieve modulation of gene 
expression. Genes so modulated can be endogenous g^es or exogenous genes. 
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Modulation of gene expression can be in tiie form of repression (eg., repressing 
expression of exogenous genes, for example, when the target gene resides in a 
pathological infecting microorganism, or rq)ression of an radogenous gene of the 
subject, such as an oncogene or a viral recqptor, that contributes to a disease state). As 
described herein, repression of a specific target gene can be achieved by using a fusion 
molecule comprising a modified plant zinc finger protein and a fimctional domaiiL 

Alternatively, modulation can be in the form of activation, if activation of a gene 
ie.g., 2L tumor suppressor gme or a transgene) can ameliorate a disease state. In this case, 
a cell is contacted with any of the fiision molecules described herein, wherein the 
modified zinc finger portion of tiie fiision molecule is specific for the target gene. The 
target gene can be an exogenous gene such as, for example, a transgene, or it can be an 
endogenous cellular gene residing in cellular, chromatm. The fimctional domain (e.g,, 
insulator domain, activation domain, etc.) enables increased and/or sustained expression 
of the target gene. 

For any such applications, the fiision molecule(s) and/or nucleic acids encoding 
one or more fiision molecules can be formulated with an acceptable carrier, to facilitate 
introduction mto and/or e>q)ression in plant cells, as is known to those of skill in the art. 

Polynucleotide and Polypeptide Delivery 

The compositions described herein can be provided to the target cell in vitro or in 
vivo. In addition, the compositions can be provided as polypeptides, polynucleotides or 
combination thereof. 

A. Delivery of Polynucleotides 

In certain embodiments, the compositions are provided as one or more 
polynucleotides. Further, as noted above, a modified plant zinc finger protein-containing 
composition can be designed as a fiision between a polypeptide zinc finger and a 
functional domain that is encoded by a fiision nucleic acid. In both fiision and non-fiision 
cases, the nucleic acid can be cloned into intermediate vectors for transformation into 
prokaryotic or eukaryotic {e.g., plant) cells for replication and/or expression. 
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Intennediate vectors for storage or mampulation of the nucleic acid or production of 
protein can be prokaryotic vectors, (e.g:, plasmids), shuttle vectors, insect vectors, or 
viral vectors for example. A nucleic add encoding a modified plant zinc finger protein 
can also cloned into an expression vector, for administration to a bacterial cell, fimgal 
cell, protozoal cell, plant cell, or animal cell, preferably a plant cell. 

To obtain expression of a cloned nucleic acid, it is typically subcloned into an 
expression vector that contains a promoter to direct transcription. Suitable bacterial and 
eukaiyotic promoters are well known in the art and described, e.g., in Sambiook et a/., 
supra; Ausubel et al., supra; and Kriegler, Gene Transfer and Expression: A 
Laboratory Manual (1990). Bacterial expression syst^s are available in, e,g., E, colU 
Bacillus sp., and Salmonella. Palva et al (1983) Gene llillS-l^S. Kits for such 
expression systems are commercially available. Eukaryotic expression systems for 
mammalian cells, yeast, and insect cells are well known in the art and are also 
commercially available, for example, fiom Mvitrogen, Carlsbad, CA and Clontech, Palo 
Alto, CA. 

Plant expression vectors and reporter genes are also generally known in the art. 
{See, Gruber et al (1993) in Methods of Plant Molecular Biology and 
Biotechnology^ CRC Press.) Such systems include in vitro and in vivo recombinant DNA 
techniques, and any other synthetic or natural recombmation. (See, e,g., Transgenic 
Plants: A Production System for Industrial and Pltarmaceutical Proteins, Owen and Pot 
eds., John Wiliey & Sons, 1996; Transgenic Plants, Galun and Breiman eds. Imperial 
College Press, 1997; Applied Plant Biotechnology, Chopra, Malik, and Bhat eds,. Science 
Publishers, Inc., 1999.) 

The promoter used to direct ejqjression of the nucleic add of choice depends on 
the particular application. For example, a strong constitutive promoter is typically used 
for expression and purificatioiL In contrast, when a protein is to be used in vrv(?, either a 
constitutive or an inducible promoter is used, depending on the particular use of the 
protein. In addition, a weak promoter can be used, when low but sustained levels of 
protein are required. The promoter typically can also include elements that are 
responsive to transactivation, e.g., hypoxia response elements and small molecule control 
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systems such as tet-regulated systems and the RU-486 system. See, e.g,, Go^en et al 
(1992) Proa Natl Acad. Sci USA 89:5547-5551; OUgino et a/.(1998) Gene Ther. 5:491- 
496; Wang et al. (1997) Gene Ther. 4:432-441; Neering et al. (1996) Blood 88:1147- 
1155; andRendahle/a/. (1998) Mi^. BiotechnoL 16:757-761. 

Promoters suitable for use in plant egression systems include, but are not limited 
to, viral promoters such as tihie 35S RNA and 19S RNA promoters of cauliflower mosaic 
virus (CaMV) (Brisson et al. (1984) Nature Ji 0:51 1-514, Exanq)le 1); the coat protein 
promoter of TMV (Takamatsu et al. (1987) EMBO J. 5:307-31 1); plant promoters such 
as the small subunit of RUBISCO (Coruzzd et al. (1984) EMBO J. 5:1671-1680; Broglie 
et al (1984) Science 22^:838-843; plant heat shock promoters, e.g., soybean hspl7.5-E 
or hspl7.3-B (Gurley et al. (1986) Cell. Biol. 5:559-565) may be used. Other examples 
of promoters that may be used m expression vectors comprising nucleotides encoding 
modified plant ZFPs include the promoter for the small subunit of ribulos6-l,5-bis- 
phosphate carboxylase; promoters firom tmnor-iaducing plasmids of Agrobacterium 
tumefadens, such as the RUBISCO nopaline synthase (KOS) and octopine synthase 
promoters; bacterial T-DNA promoters such as mas and ocs promoters; or the figwort 
mosaic virus 35S promoter. 

In a preferred embodiment, the modified plant ZFP polynucleotide sequence is 
under the control of the cauliflower mosaic virus (CaMV) 35S promoter (Example 3). 
The caulimorviras family has provided a number of exemplary promoters for transgene 
expression in plants, in particular, the (CaMV) 35S promoter. (See, e.g., Kay et al. (1987) 
. Science 236:1299.) Additional promoters fiom this family such as the figwort mosaic 
vims promoter, the Cormnelina yellow mottle virus promoter, and the rice tungro 
bacilliform virus promoter have been described in the art, and may also be used in the 
methods and compositions disclosed herein. (See, e,g.^ Sanger et al. (1990) Plant Mol 
Biol. 14:433-443; Medberry et al (1992) Plant Cell 4:195-192; Yin and Beachy (1995) 
Plant J. 7:969-980.) 

The promoters may be modified, if desired, to affect their control characteristics. 
For example, the CaMV 35S promoter may be ligated to the portion of the RUBISCO 
gene that represses the expression of RUBISCO in the absence of light, to create a 
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promoter fhat is active in leaves, but not in roots. The resulting chnneric promoter may 
be used as desoibed herein. Constitutive plant promoters such as actin and ubiquitin, 
having general expression properties known in the art may be used to express modified 
plant ZFPs. {See, eg., McElioy et al (1990) Plant Cell 2:163-171; Christoisen et ah 
(1992) Plant MoL Biol. 18:675-689.) 

Additionally, depending on the desired tissue, expression may be targeted to the 
endosperm, aleurone layer, embryo (or its parts as scutellum and cotyledons), pericarp, 
st^, leaves tubers, roots, etc. Examples of known tissue-specific promoters include the 
tuber-directed class I patatin promoter, the promoters associated with potato tuber 
ADPGPP genes, the soybean promoter of p-conglydnin (7S protein) which drives seed- 
directed transcription, and seed-directed promoters from the zein genes of maize 
endosperm, (fee, e.g., Bevan et aL, 1986, Nucleic Acids Res. 14: 4625-38; Muller et aL, 
1990, MoL Gen. Genet 224: 136-46; Bray, 1987, Planta 172: 364-370 ; Pedersen et aL, 
1982, Cell 29: 1015-26.) Additional seed-specific promoters include the phaseolin and 
ns^in promoters. 

In addition to a promoter, an expression vector typically contains a transcription 
unit or CTtpression cassette tiiat contains additional elements required for the expression of 
the nucleic acid in host cells, either prokaryotic or eukaryotic. A typical expression 
cassette thus contains a promoter operably linked, e.g., to the nucleic acid sequence, and 
signals required, e,g., for efficient polyadenylation of the transcript, transcriptional 
termination, ribosome binding, and/or translation terminatioiL Additional elements of the 
cassette may include, e.g., enhancers, and heterologous spliced intronic signals. 

The particular expression vector used to transport the genetic information into the 
cell is selected with regard to the intended use of the resulting ZFP polypeptide, e.g,, 
expression in plants. 

In addition, the recombuiant constructs may include plant-expressible selectable 
or screenable marker genes for isolating, identifying or tracking of plant cells 
transformed by these constracts. Selectable markers include, but are not limited to, genes 
that confer antibiotic resistances (e.g., resistance to kanamycin or hygromycin) or 
herbicide resistance (e.g., resistance to sulfonylurea, phosphinotbricin, or glyphosate). 
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Screenable maikers include, but are not limited to, the genes encoding beta-glucuronidase 
(Jefferson (1987) Plant Molec Biol 5:387-405), luciferase (Ow et al. (1986) Science 
234:856-859), and the B and CI gene products that regulate anthocyanin pigment 
production (Goff et al. (1990) £MBO/9:2517-2522). 

Other elements that are optionally included in expression vectors also include a 
replicon that functions m E. coli (or in the prokaiyotic host, if other than E. coli), a 
selective maiker that functions in a prokaiyotic host, e.g., a gene encoding antibiotic 
resistance, to permit selection of bacteria that harbor recombinant plasmids, and unique 
restriction sites in nonessential regions of the vector to allow insertion of recombinant 
sequences. 

Standard transfection methods can be used to produce bacterial, mammalian, 
yeast, insect, other cell lines or, preferably, plants that express large quantities of 
modified plant zinc finger protems, which can be purified, if desired, using standard 
techniques. See, e,g., CoUeyetal (1989)J.BioL Chem. 264:17619-17622; and Guide to 
Protein Purification, in Methods in Enzymology^ vol. 182 (Deutscher, ed.) 1990. 
Transformation of non-plant eukaiyotic cells and prokaryotic cells are performed 
according to standard techniques. See, e,g„ Morrison (1977) X BacterioL 132:349-351; 
Clark-Curtiss et al (1983) inMethods in Enzymology 101:347-362 (Wu et a/., eds). 

Transformation systems for plants as also known. (See, e.g., Weissbach & 
Weissbach, Methods for Plant Molecular Biology, Academic Press, NY, Section Vm, pp. 
421-463 (1988); Grierson & Corey, Plant Molecular Biology, 2d Ed., Blackie, London, 
Ch. 7-9 (1988).) For example, Agrobacterium is often successfully employed to 
introduce nucleic acids into plants. Such transformation preferably uses binary 
Agrobacterium T-DNA vectors which can be used to transform dicotyledonous plants, 
monocotyledonous plants and plant cells (Bevan (1984) Nuc. Acid Res. 12:871 1-8721; 
Horsch et al. (1985) Science 227:1229-1231; Bevan et al. (1982) Ann. Rev. Genet 16:357- 
384; Rogers et al. (1986) Methods EnzymoL 118:627-641; Hemalsteen et al. (1984) 
EMBO J 3:3039-3041). fii embodiments that utilize the iigroftactenwm system for 
transforming plants, the recombinant DNA constructs typically comprise at least the right 
T-DNA border sequence flanking the DNA sequences to be transformed mto the plant 
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cell. In preferred embodimeaits, the sequences to be transferred are flanked by the right 
and left T-DNA border sequaices. The design and construction of such T-DNA based 
transformation vectors are well known to those skilled in the art 

Other gene transfer and transformation methods include, but are not limited to, 
protoplast transfoimation through calcium-, polyethylene glycol (PEG)- or 
electroporation-mediated uptake of naked DNA (see Paszkowski et al. (1984) EMBO J 
3:2717-2722, Potrykus et al. (1985) Molec. Gen. Genet. 199:169-177; Fromm et al. 

(1985) Proc. Nat Acad. Sci. USA 82:5824-5828; and Shimamoto (1989) Nature 338:274- 
276); electroporation of plant tissues (DHalluin et al. (1992) Plant Cell 4:1495-1505); 
microinjection, silicon caAide mediated DNA i^ptake (Kaeppler et al. (1990) Plant Cell 
Reporter 9:415-418), microprojectile bombardment (see Klem et al. (1983) Proa Nat. 
Acad. Sci. USA 85:4305-4309; and Goidon-Kamm et al. (1990) Plant Cell 2:603-618); 
direct gene transfer, in vitro protoplast transformation, plant virus-mediated 
transformation, Uposome-mediated transformation, and ballistic particle acceleration 
{See, e.g., Paszkowski etal. {19M) EMBO J. 3:2717-2722; U.S. Patent Nos. 4,684,611; 
4,407,956; 4,536,475; Crossway et al., (1986) Biotechniques 4:320-334; Riggs et al 

(1986) Proc. Natl. Acad. Sci USA 83:5602-5606; EBnchee et al. (1988) Biotechnology 
6:915-921; U.S. Patent No. 4,945,050.) 

A wide variety of host cells, plants and plant cell systems can be used, including, 
but not limited to, those monocotyledonous and dicotyledonous plants, such as crops 
mcludang grain crops (e.g., wheat, maize, rice, millet, barley), fruit crops (e.g., tomato, 
apple, pear, strawbeny, orange), forage crops (e.g., alfalfe), root vegetable crops (e.g., 
carrot, potato, sugar beets, yam), leafy vegetable crops (e.g., lettuce, spinach); flowering 
plants (e.g., petunia, rose, chrysanthemum), conifers and pine trees (e.g., pine fir, spruce); 
plants used in phytoremediation (e.g., heavy metal accumulating plants); oil crops (e.g., 
sunflower, rape seed) and plants used for experimental purposes (eg., Arabidopsis). 

Modified plant ZFPs and the resulting gene product the ZFP modulates can also 
be produced from seed by way of seed-based production techniques using, for example, 
canola, com, soybeans, rice and barley seed, and the modified plant ZFP, and/or 
sequences encoding it, can be recovered during seed gmnination. See, e.g., PCX 
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PubHcation Numbers WO 9940210; WO 9916890; WO 9907206; U.S. Patent No.: 
5,866,121; and U.S. Patent No.: 5,792,933; and all references cited therein. 

B. Delivery of Polvoeptides 

In additional embodiments, modified plant ZFPs or fusion proteins comprising 
modified plant ZFPs are administered directly to target plant cells. In certain in vitro 
situations, the target cells are cultured in a medium containing a fiision protein 
comprising one or more fimctional domains fused to one or more of the modified plant 
ZFPs described herein. An important factor in the administration of polypeptide 
compounds in plants is ensuring that the polypeptide has the ability to traverse a cell wall. 
However, proteins, viruses, toxins, ballistic methods and the like have the ability to 
translocate polypeptides across a plant cell wall. 

For example, •*plasmodesmata" is the term given to explam cell-to-cell transport 
of endogenous and viral proteins and ribonucleoprotein complexes (RNPCs) in plants. 
Examples of viruses which can be linked to a modified plant zinc finger polypeptide (or 
fusion containing the same) for facilitating its uptake into plant cells include, tobacco 
mosaic virus (Oparkaetal. (1997) Plant J. 12:781-789; rice phloem thioredoxin 
(Ishiwatari et al. (1998) Planta 205:12-22); potato virus X (Cruz et al. (1998) Plant Cell 
10:495-510) and the like. Other suitable chemical moieties that provide enhanced 
cellular uptake can also be linked, either covalently or non-covalently, to the ZFPs. 
Toxin molecules also have the ability to transport polypeptides across cell walls. 

Particle-mediated delivery techniques (eg., ballistic injection) as described above 
regarding nucleic acids can also be used to introduce polypeptides into a plant cell. 

Applications 

The modified plant zinc finger proteins and fiision molecules disclosed herein, 
and expression vectors encoding these polypeptides, can be used to modulate the 
expression of, or the action of, any plant endogenous or exogenous gene or gene product. 
In such applications, modified plant ZFP-containing compositions can be administered 
directly to a plant, e.g., to facilitate the modulation of gene expression. Preferably, the 
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modulated gene is eadogeaous, for example a gene involved in growth, development, 
morphology, seed or fruit-bearing ability and the like. The gene product itself may be 
isolated and, accordingly, modulation of endogenous plant gmes can be achieved using 
plant-derived sequences. 

Accordingly, expression of any gene in any organism, for example plants or 
fungi, can be modulated using the methods and compositions disclosed herein, including 
therapeutically relevant genes, genes of infecting microorganisms, viral genes, and genes 
whose expression is modulated in the processes of drug discovery and/or target 
validation. Such genes mclude, but are not limited to, Wilms' fhiid tumor gene (WT3), 
vascular endothelial growth fectors (VEGFs), VEGF receptors (e.g,,flt andflk) CCR-5, 
low density Iqjoprotein receptor (IDLR), estrogen receptor, HER-2/neu, BRCA-1, 
BRCA-2, phosphoenolpyruvate caiboxykinase (PEPCK), CYP7, fibrinog^ 
apolipoprotein A (ApoA), a5)olipoprotein B (ApoB), renin, phosphoenolpyruvate 
caiboxykinase (PEPCK), CYP7, fibrinogen, nuclear factor kB (NF-kB), inhibitor of NF- 
kB (I-kB), tumor necrosis factors (e.g., TNF-a, TNF-P), interleukin-1 (IL-1), FAS 
(CD95), FAS ligand (CD95L), atrial natriuretic factor, platelet-derived factor (PDF), 
amyloid precursor protem (APP), tyrosinase, tyrosine hydroxylase, P-aspartyl 
hydroxylase, alkaline phosphatase, ca^)ains (e.g., CAPNIO) neuronal pentnodn receptor, 
adriamycin response protein, apolipoprotein E (apoE), leptin, lq)tin receptor, UCP-1, 
IL-1, IL-1 receptor, IL-2, IL-3, IL-4, EvS, IL-6, IL-12, IL-15, interleukin receptors, 
G-CSF, GM-CSF, colony stimulating factor, erythropoietin (EPO), platelet-derived 
growth factor (PDGF), PDGF receptor, fibroblast growth factor (FGF), FGF receptor, 
PAF, pl6, pl9, p53, Rb, p21, myc, myb, globin, dystrophin, eutrophin, cystic fibrosis 
transmembrane conductance regulator (CJTR), GNDF, nerve growth factor (NGF), NGF 
receptor, epidermal growth factor (EGF), EGF receptor, transforming growth fectors 
(eg., TGF-a, TGF-P), fibroblast growth factor (FGF), interferons (e.g., IFN- a, IFN- p 
and IFN-y), insulin-related growth fector-1 (IGF-l), angiostatin, ICAM-1, signal 
transducer and activator of transcription (STAT), androgen receptors, e-cadherin, 
cathepsins (e.g., cathepsin W), topoisomerase, telomerase, fee/, bcl-2, Bax^ T Cell-specific 
tyrosme kinase {LcK), p38 mitogen-activated protein kinase, protein tyrosine phosphatase 
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(hPTP), adenylate cyclase, guanylate cyclase, a7 neuronal nicotinic acetylcholine 
receptor, S-hydroxytryptamine (serotonin)-2Arecq)tor, transcription elongation factor-3 
(TEF-3), phosphatidylcholine transferase,^, PTI-1, polygalacturonase, EPSP synthase, 
FAD2-1, A-9 desaturase, A-12 desaturase, A-15 desaturase, acetyl-Coenzyme A 
carboxylase, acyl-ACP thioesterase, ADP-glucose pyrophosphorylase, starch synthase, 
cellulose synthase, sucrose synthase, fatty acid hydroperoxide lyase, and peroxisome 
proliferator-activated receptors, such as PPAR-y2. 

Expression of human, mammalian, bacterial, fungal, protozoal, Archaeal, plant 
and viral genes can be modulated; viral genes include, but are not limited to, hepatitis 
virus genes such as, for exan:q)le, HBV-C, HBV-S, HBV-X and HBV-P; and HTV genes 
such as, for example, tat and rev. Modulation of expression of genes encoding antigens 
of a pathogenic organism can be achieved usmg the disclosed methods and compositions. 

In other embodiments, the modulated gene can be exogenous, for example, a 
transgene that has been inserted into the plant. Techniques for gaierating transgenic 
plants are known in the art (see, e.g.. Swam W F (1991) TIBTECH9: 107-109; Ma J K C 
et al. (1994) Eur J Immunology 24: 131-138; Hiatt A et al. (1992) FEBS Letters 307:71- 
75; Hein M B et al. (1991) Biotechnology Progress 7: 455-461; During K (1990) Plant 
Molecular Biology 15: 281-294). As with endogenous genes, the modified plant ZFP (or 
fusion polypeptides conqmsing the modified plant ZFPs described herein) can then 
modulate expression of a transgene, for example to produce a protein product of interest, 
without the need for regulatory molecules derived primarily &om non-plant {e.g., animal) 
sources. 

Accordingly, the compositions and methods disclosed herein can be used to 
facilitate a number of processes involving transcriptionai regulation in plants. These 
f)rocesses include, but are not limited to, transcription, replication, recombination, repair, 
integration, maintenance of telomwes, processes involved in chromosome stability and 
disjunction, and maintenance and propagation of chromatin structures. The methods and 
compositions disclosed herein can be used to afiect any of these processes, as well as any 
other process fliat can be influenced by ZFPs or ZFP fusions. 
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Additional exenqilary applications for modulation of gene expression in plant 
cells using modified plant ZFPs include, for example, the optimization of crop traits 
affecting nutritional value, yield, stress tolerance, pathogen resistance, and resistance to 
agrochemicals (e.g. insecticides and/or herbicides). In addition, targeted gene regulation 
can be used to study gene function in plants, and to adq)t plants for use as biological 
factories for the production of pharmaceutical compounds or industrial chemicals. 

In preferred embodiments, one or more of the molecules described herein are used 
to achieve targeted activation or repression of gene expression, e.g., based upon the target 
site specificity of the modified plant ZFP. In anoflier embodunent, one or more of the 
molecules described herein are used to achieve reactivation of a gene, for example a 
developmentally silenced gene; or to achieve sustained activation of a transgene. A 
modified plant ZFP can be targeted to a region outside of the coding region of the gene of 
interest and, m certain embodiments, is targeted to a region outside of known regulatory 
region(s) of the gene. In these embodiments, additional molecules, exogenous and/or 
endogenous, can optionally be used to facilitate repression or activation of gene 
expression. The additional molecules can also be fiision molecules, for example, firsions 
between a ZFP and a fimctional domain such as an activation or repression domain. See, 
for example, co-owned WO 00/41566. 

hi other applications, modified plant ZFPs and other DNA- and/or RNA-binding 
proteins are used in diagnostic methods for sequence-specific detection of target nucleic acid 
in a sample. For example, modified plant ZFPs can be used to detect variant alleles associated 
with a phenotype in a plant. As an example, modified plant ZFPs can be used to detect the 
presence of particular mKNA species or cDNA in a complex mixtures of mRNAs or cDNAs. 
As a fijTther example, modified plant ZFPs can be used to quantify the copy number of a gene 
in a sample. A suitable format for performing diagnostic assays en5)loys modified plant ZFPs 
linked to a domain that allows immobiUzation of the ZFP on a solid support such as, for 
example, a microtiter plate or an ELISA plate. The immobilized ZFP is contacted with a 
sample suspected of containing a target nucleic acid under conditions in which binding 
between the modified ZFP and its target sequence can occur. Typically, nucleic adds in the 
sample are labeled (e.g., in the course of PGR amplification). Alternatively, unlabelled 
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nucleic acids can be detected using a second labeled probe nucleic acid. After washing, 
bound, labeled nucldc acids are detected. Labeling can be direct the piobe binds directly 
to the target nucleic acid) or indirect {ie,^ probe binds to one or more molecules which 
themselves bind to the target). Labels can be, for example, radioactive, fluorescent, 
chemiluminescent and/or enzymatic. 

Modified plant ZFPs, as disclosed herein, can also be used in assays that link 
phenotype to the expression of particular genes. Current methodologies for determination of 
gene function rely primarily upon either over-expressing a gene of interest or removing a gene 
of interest fix>m its natural biological setting, and observing the effects. The phenotypic 
effects resulting from over-expression or knockout are then mtecpreted as an indication of the 
role of tiie gene in the biological system. Up- or down-regulation of gene expression using 
one or more modified plant ZFPs obviates the necessity of generating transgenic plants for use 
in these types of assay. 

The following examples are presented as illustrative of, but not limiting, the 
claimed subject matter. 

EXAMPLES 

Example 1. Production of modified plant zinc finger binding proteins 

This example describes a strategy to select amino acid sequences for plant zinc finger 
backbones from among existing plant zinc finger sequences, and subsequent conceptual 
modification of the selected plant zinc finger amino acid sequences to optinuze flieir DNA 
binding ability. Oligonucleotides used in the prq)aration of polynucleotides encoding 
proteins containing these zinc fingers in tandem array are then described. 

A. Selection of plant zinc finger backbones 

A search was conducted for plant zinc fingers whose backbone sequences (i.e., the 
portion of the zinc finger outside of the -1 through +6 portion of the recognition helix) 
resembled that of the SP-1 consensus sequence described by Berg (1992) Proa Natl Acad 
ScL C/iS4 89:11,109-11,110. The sequences selected included the two conserved cysteine 
residues, a conserved basic residue (lysine or arginine) located two residues to the C-terminal 
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side of (he second (ie. C-tenmnal) cysbmie, a conserved phenylalanine residue located two 
residues to the C-tenninal side of the basic residue, the two conserved histidine residues, and a 
conserved arginine residue located two residues to the C-tOTninal side of the first (i.e., N- 
terminal) conserved histidine. The amino acid sequences of these selected plaat zinc finger 
backbones (compared to flie SP-1 consensus sequence) are shown below, with cons^ed 
residues shown in bold and X referring to residues located at positions -1 through +6 in the 
recognition helix (which will differ among different proteins depending xxpon the target 
sequence): 

SP-l consensus: YKCPECGKSPSXXXXXXXHQRTHTGEKP (SEQ ID NO: 11) 
Fl: KKKSKGHECPiCFRVPKXXXXXXXHKRSHTGEKP (SEQ ID NO: 12) 

F2 YKCn^CGKSPSXXXXXXXHKRLHTGEKP {SEQ ID NO: 13) 

F3 FSCNYCQRKFYXXXXXXXHVRIH (SEQ ID NO: 14) 

-5 -1 5 

The first finger (Fl) was chosen because it contained a basic sequence N-terminal to the 
finger that is also found adjacent to the first finger of SP-1 . The finger denoted Fl is a Petunia 
sequence, tiie F2 and F3 fingors dx^Arabidopsts sequences. 

B. Modification of p lant zinc fin ger backbones 

Two of the three plant zdnc fingers (Fl and F3, above) .were modified so that their amino 
acid sequences more closely resembled the sequence of SP-1, as follows. (Note that the 
sequence of SP-1 is different firom tiie sequence denoted "SP-1 consensus.") In F3, the Y residue 
at position -2 was converted to a G, and the sequence QNKK (SEQ ID N0:15) was added to the 
C-tenninus of F3. The QNKK sequence is present C-tenninal to the third finger of SP-1, and 
permits greater flexibility of that finger, compared to fingers 1 and 2, which are flanked by the 
helix-capping sequence T G E K/R K/P (SEQ ID NO: 16). Such flexibility can be beneficial 
when the third finger is modified to contain a non-C2H2 structure. ** Finally, several amino 
acids were removed firom the N-terminus of Fl . The resulting zinc finger backbones had the 
following sequences: 
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KSKGHECPICFKVglQQtXXXXXHK RSHTGEK P (SEQ ID NO: 17) 
YKCTVCGKSFSXXXXXXXHK RLHTGBK P (SEQ ID NO: 18) 
PSCNYCQROTGXXXXXXXHVRIHQNKK (SEQ ID NO: 19) 

Amino acid residues denoted by X, present in the recognition portion of these zinc 
fingers, are designed or selected depending upon the desired target site, according to methods 
disclosed, for exan^le, in co-owned WO 00/41566 and WO 00/42219, and/or references cited 

C. Nucleic acid sequences encoding backbones f or modifie d plant ZPPs 
The following polynucleotide sequences are used for design of a three-finger plant ZFP 
that contains the Fl, F2 and F3 backbones described above. Polynucleotides encoding multi- 
finger ZFPs are designed according to an overlapping oligonucleotide method as described in, 
fiar example, co-owned WO 00/41566 and WO 00/42219. Oligonucleotides HI, H2 and H3 
(below) comprise sequences corresponding to die reverse complonent of the recognition helices 
of fingers 1-3 respectively; accordingly, nucleotides denoted by N will vary dq>ending upon flie 
desired amino add sequences of the recognition helices, which, in tum, will depend upon the 
nucleotide sequaice of the target site. Oligonucleotides PBl, PB2 and PB3 encode the beta- 
sheet portions of the zinc fingers, which are common to all constructs. Codons used fi«quently 
mArabidopsis and E. coli were selected for use in these oligonucleotides. 

HI: 

5'-CTC ACC GOT GTG AGA ACG CTT GTG NNN NNN NNN NNN NNN MNN NNN CTT 

GAA AAC ACG GAA-3 ' 

(SEQIDNO:20) 

H2: 

5 '-TTC ACC AGT ATG AAG ACG CTT ATG NNN NNN NNN NNN NNN NNN NNN AGA 

AAAAGACTTACC-3' 

(SEQIDN0:21) 

H3: 

5'-CTT CTT GTT CTG GTG GAT ACG CAC GTG NNN NNN NNN NNN NNN NNN NNN 

ACC GAA CTT ACG CTG-3' 

(SEQIDNO:22) 
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PBl: . 

S^AAGTCTAAGGGTCACGAGTGCCCAATCTGCITCCGTGTm 
(SEQroNO:23) 

PB2: 

S^TCTCACACCGGTGAGAAGCCATACAAGTGCACTGTTTGTGGTAAGTCT^^ 
(SEQIDNO:24) 

PB3: 

S^CTTCATACTGGTGAAAAGCCATTCTCTTGCAACTACTGCCAGCGT^ 
(SEQIDNO:25) 

Briefly, these six oligonucleotides are annealed and amplified by polymerase chain 
reaction. The initial amplification product is reamplified using primers that are complementary 
to the initial amplification product and that also contain 5 ' extensions containing restriction 
enzyme recognition sites, to facilitate cloning. The second amplification product is inserted into 
a vector containing, for exanq)le, one or more fimctional domains, nuclear localization 
sequences, and/or epitope tags. See^ for example, co-owned WO 00/41566 and WO 00/42219. 



Example 2: Constniction of a polynucleotide encoding a modified plant zinc finger 
protein for binding to a predetermined target sequence 

A modified plant zinc finger protein was designed to recognize the target sequence 
5'-GAGGGGGCG-3' (SEQ ID NO:26). Recognition hehx sequences for Fl, F2 and F3 were 
determined, as shown in Table 1, and oligonucleotides corresponding to HI, H2 and H3 above, 
also including sequences encoding these recognition helices, were used for PGR assembly as 
described above. 



Table 1 



Finger 


Target 


Helix sequence 


Nucleotide sequence for PGR assembly 


Fl 


GCG 


RSDELTR 
SEQmNO:27 


S'CTCACCGGTGTGAGAACGCTTGTGACGGGTCAACT 
CGTCAGAACGCrTGAAAACACGGAA-3' (SEO ID NO:28) 


F2 


GGG 


RSDHLTR 

SEQroNO:29 


S'TTCACCAGTATGAAGACGCTTATGACGGGTCAAGT 
GGTCAGAACGAGAAAAAGACrTACC-3' (SEQ IDNO:30) 


F3 


GAG 


RSDNLTR 
SEQn>NO:31 


5 'Cnur 1 GTTCTGGTGGATACGCACGTGACGGGTCA 
AGTroTCAGAACGACCGAACrTACGCTC-3' (SEOIDNO:32) 
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Subsequent to the initial amplification, a secondary amplification was conducted, as 
described above, using the following primers: . 

PZF: 5'-CGGGGTACCAGGTAAGTCTAAGGGTCAC (SEQ ID NO:33) 

PZR: S^GCGCGGATCCACCCnTCTTGTTCTGGTGGATA^ (SEQ ID NO:34). 

PZF includes a Kpnl site (underUned) and overlaps the PBl sequence (overlap indicated 
in bold). PZR includes a BamHI (underlined) site and overlaps with H3 (indicated in bold). 

The secondary amplification product is digested with Kpn I and Bam HI and inserted into 
an appropriate vector (e.g., YCF3, whose construction is described below) to construct an 
e3q)ression vector encoding a modified plant ZFP fiised to a fimctional domain, for modulation of 
gene expression in plant cells. 

Example 3: Construction of Vectors for Expression of Modified Plant ZFPs 
YCF3 was generated as shown schematically in Figure 1. The starting construct was a 
plasmid containing a CMV promoter, a SV40 nuclear localization sequence (NLS), a ZFP 
DNA binding domain, a Herpesvirus VP16 transcriptional activation domain and a FLAG 
epitope tag (pSB5186-NVF). This construct was digested with Spel to remove the CMV 
promoter. The larger firagment was gel-purified and self-ligated to make a plasmid termed 
GFl . GFl was then digested with Kpnl and HindTTI, releasing sequences encoding the ZFP 
domain, the VP 16 activation domain, and the FLAG epitope tag, then the larger Augment was 
ligated to a Kpnl/HindUI fiagment containing sequences encoding a ZFP binding domain and 
a VP16 activation domain, named GF2. This resulted in deletion of sequences encoding the 
FLAG tag firom the construct. 

GF2 was digested with BamHI and Hindlll, releasing a small fi:agment encoding the 
VP16 activation domain, and the larger firagment was purified and ligated to a BamHI/Hindin 
digested PGR fi-agment containing the maize CI activation domain (Gofif et al (1990) EMBO 
J, 9:2517-2522) (Kpnl and Hindm sites were introduced into the PGR fi:agm«it through I^nl 
and EHndm site-containing primers) to generate NCFl. A PGR firagment containing a Maize 
Opaque-2 NLS was digested with Spel/Kpnl and ligated to the larger fiagment j&om 
KpnI/Spel digested NCFl to produce YCF2. YCF2 was then digested with Mlul and Spel 
and the larger firagment was ligated to an Mlul and Spel digested PGR firagment containing 
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the plant-derived CaMV 35S promoter (Mlul and Spel sites were introduced into the PGR 
fragment through Mlul or Spel site containing primers) to generate the YCF3 vector. 

Sequmces encoding modified plant ZFP binding domains can be inserted, as 
KpnI/BamHI fragments, into KpnI/BamHI-digested YCF3 to generate constmcts encoding 
ZFP-functional domain fusion proteins for modulation of gene expression in plant cells. For 
example, a series of modified plant ZFP domains, described in Example 4 infra^ were inserted 
into I^nl/BamHI-digested YC3^ to generate expression vectors encoding modified plant 
2TP-activation domain fiision polypeptides that enhance expression of the Arabidopsis 
thaliana GMT gene. 

Example 4: Modified Plant ZFP Designs for Regulation of an Arabidopsis 
thaliana gamma tocoplierol methyltransferase (GMT) Gene 

Modified plant zinc finger proteins were designed to recognize various target 
sequences in the Arabidopsis GMT gene (GenBank Accession Number AAD38271 . Table 2 
shows the nucleotide sequences of the various GMT target sites, and the amino acid sequences 
of zinc fingers tiiat recognize the target sites. Sequences encoding these binding domains 
were prepared as described in Example 1 and mserted into YCF3 as described in Example 3. 
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Table 2 



ZFP# 


Target 


Fl 


F2 


F3 


1 


GTGGACGAGT 
(SEQ ID NO:35) 


RSDNLAR 
(SEQ ID NO: 36) 


DRSNLTR 
(SEQ ID NO: 37) 


RSDALTR 
(SEO ID NO- 38) 


2 


CGGGATGGGT 
(SEQ ID NO:39) 


RSDHLAR 
(SEQ ID N0:40) 


TSGNLVR 
(SEO ID NO -41) 


RSDHLRE 

(SEO TD NO -4. 9) 


3 


TGGTGGGTGT 
(SEQ ID NO:43) 


RSDALTR 
(SEQ ID NO:44) 


RSDHLTT 
(SEO ID NO: 45) 


RSDHLTT 

( SEO TD NO • d \ 


4 


GAAGAGGATT 
(SEQ ID NO: 47) 


QSSNLAR 
(SEQ ID NO:48) 


RSDNLAR 
(SEO ID NO -49) 


QSGNLTR 


5 


GAGGAAGGGG 
(SEQ ID N0:51) 


RSDHLAR 
(SEQ ID N0:52) 


QSGNLAR 
(SEO ID NO -53) 


RSDNLTR 


6 


TGGGTAGTC 
(SEQ ID NO: 55) 


ERGTLAR 
(SEQ ID NO: 56) 


QSGSLTR 
(SEO ID NO-57) 


RSDHLTT 


7 


GGGGAAAGGG 
(SEQ ID NO: 59) 


RSDHLTQ 
(SEQ ID NO: 60) 


QSGNLAR 
(SEO ID NO- 61 \ 


RSDHLSR 


8 


GAA6AGGGTG 
(SEQ ID NO: 63) 


QSSHLAR 
(SEQ ID NO; 64) 


RSDNLAR 
f SEO ID NO-fi^^ 


QSGNLAR 


9 


GAGGAGGATG 
(SEQ ID NO: 67) 


QSSNLQR 

(SEQ ID NO: 68) 


RSDNALR 

(SEO ID NO-fiQl 


RSDNLQR 


10 


GAGGAGGAGG 
(SEQ ID NO: 71) 


RSDNALR 
(SEQ ID NO: 72) 


RSDNLAR 
(SEO ID NO '7^) 


RSDNLTR 


11 


GTGGCGGCTG 
(SEQ ID NO: 75) 


QSSDLRR 
(SEQ ID NO:76) 


RSDELQR 
(SEO ID NO '771 

\ abA^ X^^^ miff 


RSDALTR 

(SRO Tn ivTn«7n^ 


12 


TGGGGAGAT 
(SEQ ID NO: 79) 


QSSNLAR 

(SEQ ID NO: 80) 


QSGHLQR 

(SEQ ID NO: 81) 


RSDHLTT 

(SEQ ID NO: 82) 


13 


GAGGAAGCT 
{SEQ ID NO: 83) 


QSSDLRR 
(SEQ ID NO: 84) 


QSGNLAR 
(SEQ ID NO: 85) 


RSDNLTR 
(SEQ ID NO: 86) 


14 


GCTTGTGGCT 
(SEQ ID NO: 87) 


DRSHLTR 
(SEQ ID NO: 88) 


TSGHLTT 
(SEQ ID NO: 89) 


QSSDLTR 
(SEQ ID NO: 90) 


15 


GTAGTGGATG 
(SEQ ID NO: 91) 


QSSNLAR 
(SEQ ID NO: 92) 


RSDALSR 
(SEQ ID NO: 93) 


QSGSLTR 
(SEQ ID NO: 94) 


. 16 


GTGTGGGATT 
(SEQ ID NO:95) 


QSSNLAR 
(SEQ ID NO: 96) 


RSDHLTT 
(SEQ ID NO: 97) 


RSDALTR 
(SEQ ID NO: 98) 
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Example 5: Modulation of Espression of smArabidopsis thaliana gamma 
tocopberol methyltransferase (GMT) Gene 

Arabidopsis thaliana protoplasts were prq>ared and transfected witti plasnrids 
eacoding modified ZFP-activation domain fusion polypeptides. Prq)aiation of protoplasts 
and polyelhylene gjycol-mediated ttansfection were performed as described. Abel et al 
(1994) Plant Journal 5:42 M27. The dififerent plasmids contained the modified plant ZFP 
binding domains described in Table 2, inserted as KpnI/BamHI fragments into YCFi. 

At 18 hours after transfection, RNA was isolated fiiom transfected protoplasts, using 
an RNA extraction kit from Qiagen (Valenda, CA) according to the manufecturer's 
instructions. The RNA was then Ireated with DNase (RNase-fi^e), and analyzed for GMT 
mRNA. content by real-time PGR (TaqMan*). Table 3 shows the sequences of the primers 
and probe used for TaqMan® analysis. Results for GMT mRNA levels were normalized to 
levels of 18S rRNA. These normalized results are shown in Figure 2 as fold-acti\^on of 
GMT mRNA levels, compared to protoplasts transfected with carrier DNA (denoted "No 
ZFP" in Figure 2). The results indicate that expression of the GMT gene was oihanced in 
protoplasts diat were transfected with plasmids encoding fiisions between a transcriptional 
activation domain and a modified plant ZFP bmding domain targeted to the GMT gene. 



Tables 





SEQUENCE 


GMT forward 
primer 


5'-AATGATCTCGCGGCTGCT-3* (SEQIDNO:99) 


GMT reverse primer 


5'-GAATGGCTGATCCAACGCAT-3' (SEO ID NO lOO) 


GMT probe 


5'-TCACTCGCTCATAAGGCITCCTrCCAAGT-3' (SEQ ID NOlOl) 


18S forward primer 


5*-TGCAACAAACCCCGACnTATG-3' fSEO ID NO102) 


18S reverse primer 


5'-CCCGCGTCGACCmTATC-3' (SEO ID NO: 103^ 


18S probe 


5'-AATAAATGCGTCCCTT-3' (SEQ IDNO:104^ 



Although the foregoing methods and compositions have been described in detail for 
purposes of clarity of understanding, certain modifications, as known to those of skill in the 
art, can be practiced within the scope of the appended claims. 
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1. A method for modulating gene expression in a plant cell; the method 
comprising contacting the cell with a modified plant zinc finger protein (ZFP) comprising a 
tandem array of zinc fingers. 

2. The method of claim 1, whorein the protein is expressed in the cell. 

3 . The method of claim 2, wherein a nucleic acid encoding the protein is 
introduced into the cell. 

4. The method of any of claims 1 to 3, wherein one or more zinc fingers of the 
modified plant ZFP comprises an ad^ted amino acid sequence at any one or more of residues 
-1 through +6 of the recognition helix. 

5. The method of claim 4, wherein the ad^ted anaino acid sequence is obtained 
by rational design. 

6. The method of claim 4, wherein the ad^ted amino acid sequence is obtained 
by selection. 

7. The method of claim 6, wherein selection is accomplished through the use of a 
method selected fix)m the group consisting of phage display, interaction trq), ribosome display 
and SNA-peptide fiision. 

8. The method of any of claims 1 to 7, wherein the modified plant ZFP comprises 
a zinc fiiiger backbone derived fi'om two or more diflFerent species. 
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9. The method of any of claims 1 to 7, wherein the modified plant ZFP comprises 
a zinc finger backbone derived fix)m two or more dififermt plant species. 

10. Tlie.method of any of claims 1 to 7, wherein the modified plant ZFP comprises 
zinc finger backbones of fimgal origin. 

11. The method of any of claims 1 to 10, wherein one or more amino acid residues 
between one or more of the zinc fingers is deleted. 

12. A method for modulating gene expression in a plant cell; the method 
comprising contacting the cell with a fiision polypeptide comprising (i) a modified plant zinc 
finger protein (ZFP) comprising a tandem array of zinc fingers and (ii) a fimctional domain. 

13. The method of claim 12, wherein the fimctional domain is a repressive domain. 

14. The method of claim 12, wherein the fimctional domain is an activation 
domain. 

15. A method for producing a plant having an altered phenotype relative to the 
wild-type plant comprising the following steps: 

introducing a nucleic acid encoding a modified plant zinc finger protein (ZFP) 
comprising a tandem array of zinc fingers into one or more cells of the plant; and 

expressing the nucleic acid in the plant cell such that the plant has an altered 
phenotype relative to the wild-type plant. 

16. The method of claim 15, wherein the phenotype that is altered is selected firom 
the group consisting of nutritional value, yield, stress tolerance, pathogen resistance, 
resistance to agrochemicals, production of pharmaceutical compounds, and production of 
industrial chemicals. 
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17. A modified plant zinc finger protein conrprising one or more zinc fingers that 
bind to a target site. 

18. A fiision polypeptide comprising a modified plan zinc finger protein and at 
least one fimctional domain. 

19. A polynucleotide encoding the polypeptide of claim 17 or claim 18. 

20. An expression vector comprising the polynucleotide of claim 1 9. 

21. A host cell comprismg tiie polynucleotide of claim 19. 
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